Nearest-neighbor Queries in Probabilistic Graphs
نویسندگان
چکیده
Large probabilistic graphs arise in various domains spanning from social networks to biological and communication networks. An important query in these graphs is the k nearestneighbor query, which involves finding and reporting the k closest nodes to a specific node. This query assumes the existence of a measure of the “proximity” or the “distance” between any two nodes in the graph. To that end, we propose various novel distance functions that extend well known notions of classical graph theory, such as shortest paths and random walks. We argue that many meaningful distance functions are computationally intractable to compute exactly. Thus, in order to process nearest-neighbor queries, we resort to Monte Carlo sampling and exploit novel graph-transformation ideas and pruning opportunities. In our extensive experimental analysis, we explore the trade-offs of our approximation algorithms and demonstrate that they scale well on real-world probabilistic graphs with tens of millions of edges.
منابع مشابه
k-Nearest Neighbors in Uncertain Graphs
Complex networks, such as biological, social, and communication networks, often entail uncertainty, and thus, can be modeled as probabilistic graphs. Similar to the problem of similarity search in standard graphs, a fundamental problem for probabilistic graphs is to efficiently answer k-nearest neighbor queries (k-NN), which is the problem of computing the k closest nodes to some specific node....
متن کاملProbabilistic Voronoi Diagrams for Probabilistic Moving Nearest Neighbor Queries
Article history: Received 9 November 2010 Received in revised form 4 February 2012 Accepted 6 February 2012 Available online 21 February 2012 A large spectrum of applications such as location based services and environmental monitoring demand efficient query processing on uncertain databases. In this paper, we propose the probabilistic Voronoi diagram (PVD) for processing moving nearest neighbo...
متن کاملEfficient Query Processing Techniques in Uncertain Databases
Query processing on uncertain data has become increasingly important in many real-world applications. In this paper, we present our works on formulating and tackling three important queries in uncertain databases, that is, probabilistic group nearest neighbor (PGNN), probabilistic reverse skyline (PRSQ), and probabilistic reverse nearest neighbor (PRNN) queries.
متن کاملProbabilistic Nearest Neighbor Queries on Uncertain Moving Object Trajectories
Nearest neighbor (NN) queries in trajectory databases have received significant attention in the past, due to their application in spatiotemporal data analysis. Recent work has considered the realistic case where the trajectories are uncertain; however, only simple uncertainty models have been proposed, which do not allow for accurate probabilistic search. In this paper, we fill this gap by add...
متن کاملMoving Convolutions and Continuous Probabilistic Nearest-Neighbor Queries for Uncertain Trajectories
This report presents our solution to the problem of processing continuous Nearest Neighbor (NN) queries for moving objects trajectories when the exact position of a given object at a particular time instant is not known, but is bounded by an uncertainty region. As has already been observed in the literature, the answers to continuous NN-queries in spatiotemporal settings are time parameterized ...
متن کامل